Practice of Epidemiology Strategies for Multiple Imputation in Longitudinal Studies

نویسندگان

  • Michael Spratt
  • James Carpenter
  • Jonathan A. C. Sterne
  • John B. Carlin
  • Jon Heron
  • John Henderson
  • Kate Tilling
چکیده

Multiple imputation is increasingly recommended in epidemiology to adjust for the bias and loss of information that may occur in analyses restricted to study participants with complete data (‘‘complete-case analyses’’). However, little guidance is available on applying the method, including which variables to include in the imputation model and the number of imputations needed. Here, the authors used multiple imputation to analyze the prevalence of wheeze among 81-month-old children in the Avon Longitudinal Study of Parents and Children (Avon, United Kingdom; 1991–1999) and the association of wheeze with gender, maternal asthma, andmaternal smoking. The authors examined how inclusion of different types of variables in the imputation model affected point estimates and precision, and assessed the impact of number of imputations on Monte Carlo variability. Inclusion of variables associated with the outcome in the imputation model increased odds ratios and reduced standard errors. When only 5 or 10 imputations were used, variability due to the imputation procedure was substantial enough to affect conclusions. Careful preliminary analysis identified the scope for multiple imputation to reduce bias and improve efficiency and provided guidance for building the imputation model. When data are missing, such preliminary analyses should be routinely undertaken and reported, regardless of whether multiple imputation is used in the final analysis.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

چند رویکرد برخورد با مقادیر گمشده‌ متغیرهای کمی و بررسی اثر آنها بر نتایج حاصل از یک کارآزمایی‌ بالینی

Background and Objectives: A major challenge that affects the longitudinal studies is the problem of missing data. Missing in the data may result in the loss of part of the information which reduces the accuracy of the estimator and obtain the results will be biased and inaccurate. Therefore, it is necessary to evaluate the missing data mechanism from a longitudinal research and to consider thi...

متن کامل

کاربرد جای گذاری چندگانه در تحقیقات پزشکی و اپیدمیولوژی

Data missing, which occurs for different reasons, is an unavoidable problem in epidemiological studies. It is quite widespread and, therefore, it is considered as a challenge in research design and data analysis by many methodologists. Complete case analysis is often used in studies with missing data however, this approach may result in inaccurate estimates and inferences due to bias associated...

متن کامل

Using multiple imputation to deal with missing data and attrition in longitudinal studies with repeated measures of patient-reported outcomes

OBJECTIVE Missing data is a ubiquitous problem in studies using patient-reported measures, decreasing sample sizes and causing possible bias. In longitudinal studies, special problems relate to attrition and death during follow-up. We describe a methodological approach for the use of multiple imputation (MI) to meet these challenges. METHODS In a cohort of patients treated with percutaneous c...

متن کامل

Practice of Epidemiology Multiple Imputation in a Longitudinal Cohort Study: ACase Study of Sensitivity to Imputation Methods

Multiple imputation has entered mainstream practice for the analysis of incomplete data. We have used it extensively in a large Australian longitudinal cohort study, the Victorian Adolescent Health Cohort Study (1992–2008). Although we have endeavored to follow best practices, there is little published advice on this, and we have not previously examined the extent to which variations in our app...

متن کامل

Influence of Pattern of Missing Data on Performance of Imputation Methods: An Example from National Data on Drug Injection in Prisons

Background Policy makers need models to be able to detect groups at high risk of HIV infection. Incomplete records and dirty data are frequently seen in national data sets. Presence of missing data challenges the practice of model development. Several studies suggested that performance of imputation methods is acceptable when missing rate is moderate. One of the issues which was of less concern...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010